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anp axqxssodmx ao ^inoxgjTP A T V»nbeaj P«* 'eAXSua^ux aoqBi 
'buxmnsuoo amx* sx sxsAxbub buxpuBq ox*auabo*Ao reuox^uaAUoo 
aaomaai^anj - pa:*aadaa*ux Axxn3 ^ouubo uox^cmaojux 
adA^oAa** asnBoaq ArFTTV* BTWTI 3° sx sxsAxbub ox^auabo^Ao 
Tbuoxww 'sabu«vo ox^aueb aaq^o pub suoxwotsuw axdxrcnm 
W w s adAq.o A jcts^( xaTdmoo ui • (samosomoauo pauxB^s-BsnaxD 
ux puBq b 30 x«p™ an* Axa^enixxoaddB) qw ox ATaWXOJddB ubu.:* 
aabaBi suoxbaa 'jeAaftoy 'pa^xoxi sx sanbxuuoa* ox*auabo*Ao 
5 o uot^tossj aiii -suoxbaa x«mosouioauo pa^axap ao paxjXTdmB 
bux}oa:*ap ao3 pou.*aui tbuot^TP"^ S T sox:*auaboq.Ao 

♦asBasxp 50 sxsoubBxp Axa«a 
ux xnjdian eq ubo sax3TT««"°uqB uons 50 uox^oa^ap T^^uaad 
jo spou.:*aui 'smw, -samoapuAs uox^axap oaoxm ai« ao xz Amosxa} 
sb upns s^uambas amosomoauo 30 uxBb ao ssot moaj TTnsaa 
Ai^uanfaaaj smaxqoad ox^aua6 x**eux:iad 'uoxiTPP* ui 

•uox^oa^ap aonm:; aaxxaBa ^xiaaad pue 'asuodsaa 
oxanadBaay^ 30 uox^BOx^souboad aAoadmx 'asBasxp aoj sxsvq 
TBOx6o T oxq am auxjap 0} syojja awxxxoBj ubo uoxssaaboad 
^uanbasqns pup uoxq.Buiao3SUBa:v ox^sBxdoau o^ buxpBax s^uaAa 
ox^auab am 30 uox^Boxjx^uapi -sauab aossaaddns aoranq. 30 
uox^bax^obux ao sauaboouo 30 uox:*bax*ob ux buxrrnsaa saouanbas 
YNu 30 ssot ao uxb6 am. aAXOAUX saxouBUbxxBiu Aubui 'axdurexa aoj 
•asBasxp 30 uox^oa^ap pub Apn^s au^ aoj suox^Bxndod \\bo buomB 
uoxssaadxa ao abBSop auaB ux saouaaa33TP 3° uox}BOX3T}uapx 
au* 03 paq.oaaxp aas saxpn^s ox^auab pub oxuiouab Aubw 

•saouanbas 30 suox^-oaxxoo 

aamo ux saouanbas asam. 30 aaqmnu Adoo au^ aAX^Bxaa saouanbas 
30 uox^oaiTOO b ux saouanbas pxob oxaxonu asxnox^aBd 30 
saaqnmu Adoo buxaBdmoo aos spoq^am uoxaBZxpxaqAu pxoB oxaxonu 
30 asn am 0} sa^sxaa *x '«inoTWBd ui -sasBasxp snoxaBA 
tttx* pa^BXOossB sax^xxeuiaouqB ox^auab buxddBiu pus bux^oa^ap 
ao3 spoyq.aui oq. sa^Bxaa uox^uaAUX ^uasaad aqx 
NOIXN3ANI 3HI JO ONflOHOHDYS 

^OllVZIQtaaAM HDNHJS'Hao Al^ HAllvaVdWOJ 
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*YNO 3sai ay, moJ , T *u6 T s jaM&Ti, ATaAT^BTaj 
«=M S tTT* sitae «a, ma ux JBqnnu A d os UT psseB „ UT uaaq 

^ZlllT BUOT6 " a ' MOU36 JO suot6 « »«• °> P«">«o= 

U «" ™» »"=» «l* «°*J Taub TS „„p T X T s At „ t . a 
«°<|s TTT „ STT3D aw „ T Jaqiiinu ^ „ 

uaa q a ABt] w suo T 6aa aso„ -atctaaxa ^ . p a„, TB ST SVNQ 
o„ at,, w, TBU6TS JO 0 „„ s „, ^ ^ 6uTiM:i " 
P3TJTW.pt A„= T nb a q UKJ Adoo P as B a,=ap ao pa.a.Lx 

« a« t,= T t,« ,„. a aw, u T S uo t b.;, T-mosomo^o . su „ n 

a«os X, paonpaa s T A> T =*d*= uo T ,« TPT «, A , „a W M paAO«a;, 
„JT *"* PU * «»»»»" «B M^oq UT saouanb., 

,W,dM •«« » M «'J» * JO .roo-o-oap a„t,da„« 

o, n,rs „ x paz TPT3q x M usq , pae psTTBqII a IT „,um.mtp 
a« s PT = B a T°T3nu o„, ^ .,.„., Jonn:) ,.j _ 

* ° T -» 6 »• - T « JU a„a t d BT auo UI .,ua m aLa.«aa 

a«oua6 JO aATMadsa^T aasaa^ap pu* „„.„ UT mJmMJI 

TnatuoTXTBH -aa S . S a 3 ua„bas P a,a t ap ac pa TJTt d ttB JO u „lJ ox 
aq, AJT , UBpT p UI aouasaad a„, „a,ap o, q3B oadd s ,ua:>aa 
•*» « ST (hsd) uoT»azTp T j qA t, oxmouab .A T ,s„d«03 

~ JO SUOTD „ „ adsns JBTn=T „ Ed uo uo „™ii™ii: 

JT 'ua«,T=ads „= BB ,„ awouafi aa T , UB at,, / 3uns „, 

ST ^<ll, Ba ° ' S3q " d TBnpTA TP"T AUa M a«a XoTd^a o, A«ss B =au 
lo L, -P' 2 *T*ua a q o, a=uanbas at,, „, OTJT38[is aqMd . 

JT uaAaT a " UTmTT3 °' » os P^u«„„ A TT Aaaw s T a.oua 6 a ql 
»NQ pa« T T r*" S "^Tuwoa, pa„ t a, pua 6uT „ OIq 

A.td- (5u T „ OTq u« wos spou„„a S a«x -sanbru^a, 

« T n=,a T o n bu TS n a T dmas . ut aouanbas p T =a =Ta T =nu uaATb 
» JO aq, bu TBS ass B „ M ad ^ 3U a, BJ a,c„ 

•pa T JTTdma „ E saouanbas JO uo T „= t3 t 1u »pt o, 

sa,n qT „uo= w uot« MOJ ut Xu B ap ? AO,d 3 ou op -sa M osol^= 
a,„u Ttt a Tqnop ao • (aSH , SUOT6 „ 6uTUTejs ATSno a U a 6oMoq 
uoT«= TJTT d« Bra6 JO S9 „« U6TS 3T „ ua6o:iA:) . UOT5TppB 
S3 " oso »«<.= a sey da 3 a« a«nbap B 6uT u T „ qo UT S a T „n=T„Tp 
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•saxoads aures aq^ 

«oaj eaE saqoad aq* pup s^uamaia *abaB} U T PT=« oxa T onu 

aq} AiiBnsn -msxuBbao Aub raoa? paAxaap aq Aam spxoa oxaionu 

am -VNQO ' YNQ ' VNH 'aidmaxa ao 3 'aq Aam saqoad aq* SE 
pup s^uamaia ^efiaB^ aq* q*oq ao3 spxoa oxaionu aqj, 

•spxoa oxaionu paiaqai 30 suox^oaiioo puooas 
P ub wcj aq* 30 A*xxaidmoo aouanbas aq* uaq* ssai qonm aaa 
*uama T a *abaa* au* ux spxoa oxa T onu *abaa* aq* 3© A*xxa T dmoo 
aouanbas aqx**uasaad aq Asm *uamaia *af>aa* « «T aouanbas qoaa 
jo saxdoo aaoui jo auo '*aoddns pttos b o* punoq saxnoaTom 
pxoa oxaionu *abaa* sasxadmoo *uamaia *abaB* qo*3 

•Anaxaas jo Aisnoaua*inmxs aaq*xa 
s*uame T a *abaa* aq* o* pa*oB*uoo aq uao saqoad aqj, -anooo ubo 
s*uama T a *abaa* aq* °* uoi*azxpxaqAq pxoa oxaionu *aq* uons 
suox*xpuoo ^P un s*uamaia *abaa* 30 A*x T Banid b o* pa*os*uoo 
aaa pamaoi snq* saqoad aqj, -aaq*o qoaa moai aiqBqsxnbux*sxp 
aq pinoqs siaqai puooas pua *sax3 aqi • A T aAX*oadsaa 'siaqai 
puooas pub *sax 5 W« uox*oanoo puooas aq* ux sainoaiom 
pxoa oxaionu aq* pub uox*oanoo *sax 3 aq} ux sainoaiom pxoa 
oxaionu aq* buxiaqai sasxadmoo poq*am aqi -uox*oanoo puooas 
a ux saouanbas anas asoq* 30 saaqmnu Adoo aq* 0* aAX*Biaa 
sainoaiom pxoa oxaionu 30 uox*oa T ioo *sa T 3 b ux saouanbas pxoa 
oxaionu o** *saai *b 30 saaqmnu Adoo buxaadmoo AiaAX*B*x*UBnb 

aoj spoq*am sapxAoad uox*uaAUX *uasaad aqj, ST 
NOIXN3ANI 3Hi .10 XHVWWI1S 

•s*X3auaq aaq*o pub asaq* sapxAoad uox*uaAU? *uasaad 
aqx -asaasxp 30 sxsoubaxp aq* ^03 aiqaaxsap AiaBinox*aad 
aaa uoxssaadxa auab 30 s T eAai ux saouaaa33TP UBO Wl* 

pua sax*xiemaouqa ipmosomoaqo 30 uox*bzxibooi asxoaad aaom 
'A*XAX*xsuas pasaaaoux apxAoad qoxqn sanbxuqoa* waN 

•qw OT-S 

*noqa Anansn sx uox*mosaa HDD au* 'aouanbas a 30 Adoo auo 30 
uxab ao ssoi aq* 0* pa*xmxi sx aaqmnu Adoo ux asaaaoux ub ao 
asaaaoap a aaauj* -paaxnbaa sx uox^azxpxaqAq aibuxs b Aiuo pua 
paaxnbaa aaB saouanbas 0x3x0**5 ao 3 saqoad o N -saouanbas aq* 
30 abpai«oux aoxad ^noq^xw aaqmnu Adoo ^uaxaBA q^xn saouanbas 
aq^ 30 uoxq.BOOi aqo. sdaui pub saaAOOSxp HOD 'snqi 
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* .) unou, st SMtno at<1 „ SA „ BTBJ Mojd w souanb9s 

VHC =T«oua 6 st a q o,d aouaaajaa ^ „ * 

«R jo s,=ad S B aawo JOJ pjEpuB: , s . „ _ P 

P — tp ,0 avlms . Bq UK , ABqi ao , stTB3 P : se ^;_3 

TW~ aq „„ BTTBO aouaaaja* . an „ T3 JQ , UOT „ Tndod 

, m J " "° Uaa "" ' P»«d«d st sp T =. o T a T =r,u 

•uoT» Tr Vd U ° Ta3BTt ° 3 "™ "» ~*m „ 

aqo'dl ' PMB "" d s * =x BT ,„u 

•qoad ,0 uo T »o BTTO= BUO ^ua^poqma TBo T dA, B UI 

•emouaB b u T saouanbas eAT W ade:i-uou 
au, jo srsXtaua aq, uo „snooj uoT,uaAUT aq, 'shut. 
•-<«« «n q, T » pBxto 8q „„ (¥Na ^ ,. £ . e) spTo / D ' TB ^ 



UT B aTT paTet,sTun ' sp ™ DTBi=nu °« =rs BW r:™ 



,»osa:,o„ T3 ATTBnBn m STBqBT b T qt 

•"o„« TPTJqAll mjs ar roUB=BBMnT3 Mj BTqB , Tns 3 „ n \\ u ^ 
6UTsn ,no pa-r« B o attbotcIa, B « sp oq,a„ BUI 

♦saqoad ax^ ut saouanbas 
SU " aJJ TP jo eo T , M aaqwnu Ado= jo uo S T JB d«oo s, Tm «d 

w« o, pu T q „q, SBqoJd OJK| 3qi SB=UBnbas * 

woo BW „, BM6 an wamsjs B D3 6UTPu * Bq "v:^ u 
JBS ::t::: r e=TdXi • 3ub ° 9ts ---J-™ 

s,ua rat a , B 6«, ,q, „, SBqMd Bql JBajv 

-„ •paAesse aq o, st uoT,dTJDsuM3 

1»T* U ox„T=ossa aso fl „ uo T6 aa t«o««ap b jo aAT, E , UBSBJdB P 
o, ^ 0 ' pa,MT8s B£ < 'asaesTp w „ p B , BTOOSSB Bq 

Jo^„L° S TWOT ™° SOB " l, = ^ — T^T,o A Bin ^ua.axa „a 

,0 »»^ t « -a T d„B X a ao, -pa^Tsap st uot^b^ojut .ac^u 

ad! " U ,a6 "* " B '° PT= 8 =Ta T onu a Va aptrs 

'"Codd^ J " " " ' spsaq Jo A„ TB an t d b sb w=ns 

s«oddns 81 B«das uo aq abu, s M a K a T a „ 6j b 3 awi 
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hty-Wut asuditoD os T b uw sptob oT3 T onu *abaB* am • (YNCIO 
T8TW ,d ao TT nj b a T duiBxa ao3) a^aidmoo aq lou Aw ao Abui 
upxq* '^xun ox^auab iBuox*oun3 * °* puodsaaaoo ao ' (AaBaqxi 
oxuiouab b \110a3 sauo T o snonbx^uoo T BaaAas ao auo T o * a T duiBX3 
ao 3 ) amouab au^ 30 uoxbaa pauxiap b ux uxfixao axam babu. 
AuboxcIA:* ^uamaia *abaB} b 30 ..spxoB oxaionu ^abaBj,., 

•pazxpxaqAu. bjb spxoB oxaionu 
aqoad uoxuw o* ao^jans pttos b uo pazxixqomuix sainoaiom 
P xob oxa T onu *af>W a«« ao auo buxsxadmoo upaa 's^uamaia 

*abaB} 30 A^xiBanid b sx „ABaaB pxoB oxaionu,, Y 

SUoT^tUliSa 

•axTT P u * samouab aouaaajaa a^Buaa^iB 
'sauiouab *sa:* aUA bux T aqBi J03 s^uabeaa 'sauioauooaoms 
^uaaajjxp oa^ asxadmoo aaqq.an? Abui ^x* ayx *aXTT 
am pub 'adA* nao aouaaajaa b moaj YNQO *o 'amouab aouaaajaa 
Tcrnaou b buxviasaadaa s P xob oxaionu fiuxuxs^uoo aauxa^uoo 
b pub o^8»m Punoq spxoB oxaionu ^a6ae^ 30 AsaaB ub buxABU. 
^aoddns pttos b asxadmoo uox^uaAUX ay* 30 s*tx -uox^uaAUX 
ai» 30 spomra am ano buxAaaBo ao 5 m 5 asn siBxaa*«m 
buxsxadmoo s*x* sapxAoad osib uox^uaAUX am 

•uxBq.uoo Aatn spxoB oxaionu 
•i» 30 sox^sxaa^oBasyo aau^o A T dtux *ou op Aat^ *nq 'saqoad 
au.} usxnbux^sxp o* aouaxuaAUOo J03 pasn aaB aouaaasa:* 
pus }sa} am TBaauab ux snm •uoxwviaaBjjTp anssx} 

30 aan^Ba3 iBox^xao b sx }Bi« uoxssaadxa auab TBX*uaaa33TP 
30 uox*oa W s^xmaad sanssx* ^uaaa 3 3TP 3° s TT ao iBnraou uae* W 
suoT^Bindod YNHBi 30 uosxaBdmoo aiduiBxa aoj -snao iBmaou 
moa3 paaBdaad aq Abui saqoad u.*oq 'aau^anj -sxsB^SB^em s*x 
P ub aornn^ AaBmxad ay? uaa^aq saouaaa33TP •»» s * ou - s uosxaBdoioo 
am *bu.} os 'aomn* *bu* tuoa3 s TT ao ox W SB*aui 30 YNd oxmouab 
moa3 paaBdaad Abui aqoad :*sa* aq* pub 'saouanbas *uaaa33TP 
s^x buoure aaqmnu Adoo ux suox^BxaBA iBX^uB^sqns uxb^uoo 
Abui upxq« aoum* AaBmxad b moa 5 YNQ oxmouab uroa3 paaBdaad aq 
Abui aqoad aouaaa3aa am A T aAX*Buaa:nY 'IBmaou moa3 suox^BxaBA 
ux uox^oaw s^xmaad aqoad ^sa^ b o* sxq^ 30 uosxaBdmoo 
• (aapuaB uo buxpuadap aouanbas T BUiosouioaqo xas qoBa 30 
saxdoo ao auo P ub 'aouanbas iBmoso^nB qoBa 30 saxdoo o«^ 
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}uama T a *afiaB* UOBa azxufiooaa TTT « T TT*s 50 auo 

•^TT BXK pub 'woddns p TT os at,* < T aqB T ^ ?0 aanw 
au* sb uDns ' S « W3 jo aaqmnu b uodn puadap TTTft sax^xsuap 
*uama T a ,afiaB* aiu -sax^xsuap *uaaa 3 jxp * B aoBjans px T os au, 
uo pafiu H « aq Aem sABaaa au* jo s*uama T a *afiaB* au x 



-mure 

^noq B P ub mr/ s W qa uaa^aq Axqwajaad 'nrm C ^noq B nrix max; 
MB SSZTS * U *»*T* *TT*aaua 0 -:ra*a«B T p ux mo T W q B ubi* ssa T 
aq TTT« *ua«a T a B 'AXTsoxdAi -paaaajaad a» W a«a T a 

^a£«^ '«nw«s '/TTHaauao -uox^uaAUx «n Jo sABoaB au* u T 
pasn aq uw suoxsuamxp snoxaaA jo s*uama T a ^afiaBj 

. *sauafi asaq^. 30 

Apn*s uo uopu. W . fiuxsrooj Aqaaau* < W aa*ux go s TT ao amos ux 
passaadxa ATX*T*uaaai IT p aq ,ufi T m ^ asoi ^ , e uox W xxx:*uapx 
;TSU8d Plnort sau °T 3 VNQo snomAuouB bojj s P tob oxa T onu 
fiuxsxadmoo s*uama T a s*a£, w JO A MJB U b 'ATaB T xmx S 

♦amouafi uBumu at« 10 afisaaAoo 
a*a T dmoo batB p T noo ynq oxmouafi 30 q3 «)0T 6uxuxb W o uoaa 
s*uama T a ooo'OG jo / W « uy .. T .a» W asBqBfiam'oT :>b amouafi 
UB«mu aa W a hr a T doi BS PT noo 'auo T o oxmouafi ^uaaajjxp b 
«H°aj YNa fiuxuxB^uoo uoaa <s*uama T a *afiaB:> 00C jo Abjjb 

ub a T dmBxa a OJ .fiu T Td«». ,0 A^xsuap auxmaa^ap PT no« uoaa 

UT spxoB Dxaxonu au* jo A:>xxa T dmoo at* p UB s * U auia T a *b£>^ j Q 
aaqmnu a UJ . . aiuosomoauo b JO uo TW od b ao ' amosomoauo a T fiuxs 
* 'amouafi MT ,ue u, 'o^ pa*xmx T *ou *nq <fiuxpn T oux 'amouafi 
■ jo uoTwd paaxsap aub 's^uxod a*aaosxp *b ao A TS nonut,uoo 
'aidures suox*bdo T *uasaadaa pxnoo samara 

uons jo A»» uy -uoxwot ao aouBoxjxufixs unouvuri 30 spxoB 
ojaionu asxadmoo Am ^uama T a t^afiaB* b 'AT*AT*m«. w 
•»T«a»T Tiuouq, ,» paqxaosuBa:* fiuxaq 3 o pa^oadsns 'VNHur uons 
«oaj paAxaap ynqo ao < Y N*m u B ux W oo os T , abut ^uama T a ,afia B ^ 
«tt 'STiao aoum, -£- 9 ^ S3 « W t ? o s TT ao ux aaqmnu Adoo 
pasBaaoap ao pasBaaoux ^b ^uasaad fiuxaq JO pa^oadsns uoxfiaa 
T^osomoayo b moaj aq 'ao sauafi oxjxoads uxb^uoo <a T dmBxa'aoi 
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may comprise a mixture of target nucleic acids of different 
lengths and sequences. Thus, for example, a target element 
may contain more than one copy of a cloned piece of DNA, and 
each copy may be broken into fragments of different lengths. 
The length and complexity of the target sequences of the 
invention is not critical to the invention. One of skill can 
adjust these factors to provide optimum hybridization and 
signal production for a given hybridization procedure, and to 
provide the required resolution among different genes or 
genomic locations. Typically, the target sequences will have 
a complexity between about 1 kb and about 1 Mb. 

In preferred embodiments, the targets of the 
invention are nucleic acids which substantially lack 
superstructure associated with condensed metaphase chromosomes 
from which they are derived. The general nature of the 
packing of DNA into eukaryotic chromosomes is well known to 
those of skill in the art. Briefly, the superstructure of a 
eukaryotic chromosome comprises many orders of complexity. 
DNA is wrapped around a histone core to form regular repeating 
nucleosomes, which, in turn, are packed one upon another to 
generate more tightly condensed 3 onm chromatin fibers. The 
chromatin fibers are then further packed in a variety of 
looped domains to produce higher orders of folding and 
condensation in the metaphase chromosome. The nucleic acid 
targets of the invention lack some or all of the these 
features of naturally occurring condensed, metaphase 
chromosomes. For a general description of global structure of 
eukaryotic chromosomes, see, Alberts et al. Molecular Biology 
of the Cell 2nd ed. pp 496-506, Garland Publishing inc. New 
York, 1989) . 

The terras "nucleic acid" or "nucleic acid molecule" 
refer to a deoxyribonucleotide or ribonucleotide polymer in 
either single- or double-stranded form, and unless otherwise 
limited, would encompass known analogs of natural nucleotides 
that can function in a similar manner as naturally occurring 
nucleotides. 

As used herein a "probe" is defined as a collection 
of nucleic acid molecules (either RNA or DNA) capable of 
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binding to a target nucleic acid of complementary sequence 
through one or more types of chemical bonds, usually through 
hydrogen bond formation. The probes are preferably directly 
or indirectly labelled as described below. They are typically 
5 of high complexity, for instance, being prepared from total 
genomic DNA or mRNA isolated from a cell or cell population. 

The term "complexity" is used here according to 
standard meaning of this term as established by Britten et al. 
Methods of Enzymol . 29:363 (1974). See, also Cantor and 

10 Schimmel Biophysical Chemistry: Part III at 1228-1230 for 
further explanation of nucleic acid complexity. 

"Bind(s) substantially" refers to complementary 
hybridization between a probe nucleic acid and a target 
nucleic acid and embraces minor mismatches that can be 

15 accommodated by reducing the stringency of the hybridization 
media to achieve the desired detection of the target 
polynucleotide sequence. 

The terms "specific hybridization" or "specifically 
hybridizes with" refers to hybridization in which a probe 

2 0 nucleic acid binds substantially to target nucleic acid and 
does not bind substantially to other nucleic acids in the 
array under defined stringency conditions. One of skill will 
recognize that relaxing the stringency of the hybridizing 
conditions will allow sequence mismatches to be tolerated. 

25 The degree of mismatch tolerated can be controlled by suitable 
adjustment of the hybridization conditions. 

One of skill will also recognize that the precise 
sequence of the particular nucleic acids described herein can 
be modified to a certain degree to produce probes or targets 

30 that are "substantially identical" to others, and retain the 

ability to bind substantially to a complementary nucleic acid. 
Such modifications are specifically covered by reference to 
individual sequences herein. The term "substantial identity" 
of polynucleotide sequences means that a polynucleotide 

35 comprises a sequence that has at least 90% sequence identity, 
and more preferably at least 95%, compared to a reference 
sequence using the methods described below using standard 
parameters • 
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Two nucleic acid sequences are said to be 
"identical" if the sequence of nucleotides in the two 
sequences is the same when aligned for maximum correspondence 
as described below. The term "complementary to" is used 
herein to mean that the complementary sequence is 
complementary to all or a portion of a reference 
polynucleotide sequence* 

Sequence comparisons between two (or more) 
polynucleotides are typically performed by comparing sequences 
of the two sequences over a "comparison window" to identify 
and compare local regions of sequence similarity. A 
"comparison window", as used herein, refers to a segment of at 
least about 20 contiguous positions, usually about 50 to about 
200, more usually about 100 to about 150 in which a sequence 
may be compared to a reference sequence of the same number of 
contiguous positions after the two sequences are optimally 
aligned* 

Optimal alignment of sequences for comparison may be 
conducted by the local homology algorithm of Smith and 
Waterman Adv. Appl . Math. 2: 482 (1981), by the homology 
alignment algorithm of Needleman and Wunsch J . Mol . fliol . 
48:443 (1970), by the search for similarity method of Pearson 
and Lipman Proc. Natl. Acad. Sci. (U.S.A.) 85: 2444 (1988), by 
computerized implementations of these algorithms. 

"Percentage of sequence identity" is determined by 
comparing two optimally aligned sequences over a comparison 
window, wherein the portion of the polynucleotide sequence in 
the comparison window may comprise additions or deletions 
(i.e., gaps) as compared to the reference sequence (which does 
not comprise additions or deletions) for optimal alignment of 
the two sequences. The percentage is calculated by 
determining the number of positions at which the identical 
nucleic acid base occurs in both sequences to yield the number 
of matched positions, dividing the number of matched positions 
by the total number of positions in the window of comparison 
and multiplying the result by 100 to yield the percentage of 
sequence identity. 
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Another indication that nucleotide sequences are 
substantially identical is if two molecules hybridize to the 
same sequence under stringent conditions. Stringent 
conditions are sequence dependent and will be different in 
5 different circumstances- Generally, stringent conditions are 
selected to be about 5° C lower than the thermal melting point 
(Tm) for the specific sequence at a defined ionic strength and 
pH. The Tm is the temperature (under defined ionic strength 
and pH) at which 50% of the target sequence hybridizes to a 
10 perfectly matched probe. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 shows photomicrographs of experiments 
showing the ability of the methods of the invention to detect 

15 an amplification of the cMYC oncogene. Labelled Colo-320 DNA, 
which contains an amplification of the cMYC oncogene, and 
labelled normal human DNA were hybridized to an array 
consisting of two target elements. One target element 
contained cloned cMYC oncogene sequences, and the other 

20 contained cloned sequences from a region of the human genome 
(21D7) known to be unamplified in the Colo-320 cell line. 
Each target element comprises single stranded fragments 
corresponding to a clone. The fragments were immobilized on 
avidin coated glass particles. 

25 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 
The present invention provides methods for comparing 
abnormal nucleic acid copy number and mapping of chromosomal 
abnormalities associated with disease. The methods of the 
3 0 invention use target nucleic acids immobilized on a solid 

support, to which differentially labeled probe nucleic acids 
are hybridized. The hybridization of the labeled nucleic 
acids to the target is then detected using standard 
techniques • 

3 5 The methods of the invention compare the copy 

numbers of sequences capable of binding to the target 
elements. Variations in copy number detectable by the methods 
of the invention may arise in different ways. For example, 
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copy number may be altered as a result of amplification or 
deletion of a chromosomal region. Alternatively, copy number 
may be reduced by genetic rearrangements that alter the 
sequences in the probe or target nucleic acids sufficiently to 
reduce their binding. 

Target Nucleic Acids 

Target nucleic acids of the invention can be derived 
from virtually any source. Typically, the targets will be 
nucleic aid molecules derived from representative locations 
along a chromosome of interest, a chromosomal region of 
interest, an entire genome of interest, a cDNA library, and 
the like. These target nucleic acids may be relatively long 
(typically thousands of bases) fragments of nucleic acid 
obtained from, for instance, inter-AIu PGR products of genomic 
clones, restriction digests of genomic clone, cDNA clones and 
the like. In some embodiments the target nucleic acids are a 
previously mapped library of clones spanning a particular 
region of interest. 

The choice of target nucleic acids to use may be 
influenced by prior knowledge of the association of a 
particular chromosome or chromosomal region with certain 
disease conditions. International Application wo 93/18186, 
supra, provides a list of chromosomal abnormalities and 
associated diseases, which are described in the scientific 
literature. Alternatively, whole genome screening to identify 
new region subject to frequent changes in copy number can be 
performed using the methods of the present invention. In 
these embodiments, target elements usually contain nucleic 
acids representative of locations distributed over the entire 
genome. In some embodiments (e.g., using a large number of 
target elements of high complexity) all sequences in the 
genome can be present in the array. 

In some embodiments, previously mapped clones from a 
particular chromosomal region of interest are used as targets. 
Such clones are becoming available as a result of rapid 
progress of the worldwide initiative in genomics. 
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Mapped clones can be prepared from libraries 
constructed from single chromosomes, multiple chromosomes , or 
from a segment of a chromosome. Standard techniques are used 
to clone suitably sized fragments in vectors such as cosmids, 
5 yeast artificial chromosomes (YACs) , bacterial artificial 
chromosomes (BACs) and PI phage. 

While it is possible to generate clone libraries, as 
described above, libraries spanning entire chromosomes are 
also available commercially* For instance , chromosome* 
10 specific libraries from the human and other genomes are 

available for Clonetech (South San Francisco, CA) or from The 
American Type Culture Collection (see, ATCC/NIH Repository of 
Catalogue of Human and Mouse DNA Probes and Libraries, 7th ed. 
1993) . 

15 If necessary, clones described above may be 

genetically or physically mapped. For instance, FISH and 
digital image analysis can be used to localize cosmids along 
the desired chromosome. This method is described, for 
instance, in Lichter et al . , Science , 247:64*69 (1990). The 

2 0 physically mapped clones can then be used to more finally map 

a region of interest identified using CGH or other methods. 

Attachment of target nucleic acids to a solid surface 

Many methods for immobilizing nucleic acids on a 
25 variety of solid surfaces are known in the art. For instance, 
the solid surf ace may be a membrane, glass, plastic, or a 
bead. The desired component may be covalently bound or 
noncovalently attached through nonspecific binding. The 
immobilization of nucleic acids on solid surfaces is discussed 

3 0 more fully below. 

A wide variety of organic and inorganic polymers, as 
well as other materials, both natural and synthetic, may be 
employed as the material for the solid surface. Illustrative 
solid surfaces include nitrocellulose, nylon, glass, 
3 5 diazotized membranes (paper or nylon) , silicones, 

polyf ormaldehyde, cellulose, and cellulose acetate. In 
addition, plastics such as polyethylene, polypropylene, 
polystyrene, and the like can be used. Other materials which 
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may be employed include paper, ceramics, metals, metalloids, 
semiconductive materials, cermets or the like. in addition 
substances that form gels can be used. Such materials include 
proteins (e.g., gelatins), lipopolysaccharides, silicates, 
agarose and polyacrylamides . Where the solid surface is 
porous, various pore sizes may be employed depending upon the 
nature of the system. 

In preparing the surface, a plurality of different 
materials may be employed, particularly as laminates, to 
obtain various properties. For example, proteins (e.g., 
bovine serum albumin) or mixtures of macromolecules (e.g., 
Denhardt's solution) can be employed to avoid non-specific 
binding, simplify covalent conjugation, enhance signal 
detection or the like. 

If covalent bonding between a compound and the 
surface is desired, the surface will usually be polyfunctional 
or be capable of being polyfunctionalized. Functional groups 
which may be present on the surface and used for linking can 
include carboxylic acids, aldehydes, amino groups, cyano 
groups, ethylenic groups, hydroxy 1 groups, mercapto groups and 
the like. The manner of linking a wide variety of compounds 
to various surfaces is well known and is amply illustrated in 
the literature. For example, methods for immobilizing nucleic 
acids by introduction of various functional groups to the 
molecules is known (see, e.g., Bischoff et al., Anal. Biochem. 
164:336-344 (1987); Kremsky et al., Nuc. Acids Res. 
15:2891-2910 (1987)). Modified nucleotides can be placed on 
the target using PCR primers containing the modified 
nucleotide, or by enzymatic end labeling with modified 
nucleotides. 

Use of membrane supports (e.g., nitrocellulose, 
nylon, polypropylene) for the nucleic acid arrays of the 
invention is advantageous because of well developed technology 
employing manual and robotic methods of arraying targets at 
relatively high element densities (e.g., up to 30-40/cm 2 ) . In 
addition, such membranes are generally available and protocols 
and equipment for hybridization to membranes is well known. 
Many membrane materials, however, have considerable 
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fluorescence emission, where fluorescent labels are used to 
detect hybridization. 

To optimize a given assay format one of skill can 
determine sensitivity of. fluorescence detection for different 
combinations of membrane type, f luorochrome f excitation and 
emission bands, spot size and the like. In addition, low 
fluorescence background membranes have been described (see, 
e.g., Chu et a!., Electrophoresis 13:105-114 (1992)). 

The sensitivity for detection of spots of various 
diameters on the candidate membranes can be readily determined 
by, for example, spotting a dilution series of f luorescently 
end labeled DNA fragments. These spots are then imaged using 
conventional fluorescence microscopy. The sensitivity, 
linearity , and dynamic range achievable from the various 
combinations of fluorochrome and membranes can thus be 
determined. Serial dilutions of pairs of fluorochrome in 
known relative proportions can also be analyzed to determine 
the accuracy with which fluorescence ratio measurements 
reflect actual fluorochrome ratios over the dynamic range 
permitted by the detectors and membrane fluorescence. 

Arrays on substrates with much lower fluorescence 
than membranes, such as glass, quartz, or small beads, can 
achieve much better sensitivity. For example, elements of 
various sizes, ranging from the -lmm diameter down to -l/xm can 
be used with these materials. Small array members containing 
small amounts of concentrated target DNA are conveniently used 
for high complexity comparative hybridizations since the total 
amount of probe available for binding to each element will be 
limited. Thus it is advantageous to have small array members 
that contain a small amount of concentrated target DMA so that 
the signal that is obtained is highly localized and bright. 
Such small array members are typically used in arrays with 
densities greater than 10 4 /cm 2 . Relatively simple approaches 
capable of quantitative fluorescent imaging of 1 cm 2 areas 
have been described that permit acquisition of data from a 
large number of members in a single image (see, e.g., Wittrup 
et. al. Cytometry 16:206-213 (1994)). 
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Covalent attachment of the target nucleic acids to 
glass or synthetic fused silica can be accomplished according 
to a number of known techniques. Such substrates provide a 
very low fluorescence substrate, and a highly efficient 
hybridization environment. 

There are many possible approaches to coupling 
nucleic acids to glass that employ commercially available 
reagents. For instance, materials for preparation of 
silanized glass with a number of functional groups are 
commercially available or can be prepared using standard 
techniques. Alternatively, quartz cover slips, which have at 
least 10-fold lower auto fluorescence than glass, can be 
silanized. 

The targets can also be immobilized on commercially 
available coated beads or other surfaces. For instance, 
biotin end-labelled nucleic acids can be bound to commercially 
available avidin-coated beads. Streptavidin or anti- 
digoxigenin antibody can also be attached to silanized glass 
slides by protein-mediated coupling using e.g., protein A 
following standard protocols (see, e.g., Smith et al. Science, 
258:1122-1126 (1992)). Biotin or digoxigenin end-labeled 
nucleic acids can be prepared according to standard 
techniques . 

Hybridization to nucleic acids attached to beads is 
accomplished by suspending them in the hybridization mix, and 
then depositing them on the glass substrate for analysis after 
washing. Alternatively, paramagnetic particles, such as 
ferric oxide particles, with or without avidin coating, can be 
used. 

The prior art also describes techniques capable of 
producing high density arrays for various applications 
including sequencing by hybridization and detection of 
particular sequences (see, e.g., Fodor et al. Science 767-773 
(1991) and U.S. Patent No. 5,143,854). 

Preparation of probe nucleic acids 

As with target nucleic acids, a wide variety of 
nucleic acids can be used as probe nucleic acids in the 
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methods of the present invention- The probes may be 
comprise, for example, genomic DNA representing the entire 
genome from a particular organism, tissue or cell type or may 
comprise a portion of the genome, such as a single chromosome. 

5 

To compare expression levels of a particular gene or 
genes, the probes nucleic acids can be derived from mRNA or 
cDNA prepared from an organism, tissue, or cell of interest. 
For instance, test cDNA or mRNA, along with mRNA or cDNA from 

10 normal reference cells, can be hybridized to an array of 

clones from a normalized cDNA library. In addition, probes 
made from genomic DNA from two cell populations can be 
hybridized to a cDNA array to detect those cDNAs that come 
from regions of variant DNA copy number in the genome. 

15 The methods of the invention are suitable for 

comparing copy number of particular sequences in any 
combination of two or more populations of nucleic acids. One 
of skill will recognize that the particular populations of 
sample nucleic acids being compared is not critical to the 

20 invention. For instance, genomic or cDNA can be compared from 
two related species. Alternatively, levels of expression of 
particular genes in two or more tissue or cell types can be 
compared. As noted above, the methods are particularly useful 
in the diagnosis of disease. 

2 5 Standard procedures can be used to isolate nucleic 

acids (either DNA or mRNA) from appropriate tissues (see, 
e.g., Sambrook, et al.. Molecular Cloning— A Laboratory 
Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, 
N* Y. (1985)). Conventional methods for preparation of cDNA 

30 from mRNA can also be used. 

The particular cells or tissue from which the 
nucleic acids are isolated will depend upon the particular 
application. Typically, for detection of abnormalities 
associated with cancer, genomic DNA is isolated from tumor 

35 cells. For prenatal detection of disease, fetal tissue will 
be used. 

If the tissue sample is small, so that a small 
amount of nucleic acids is available, amplification techniques 
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such as the polymerase chain reaction (PCR) using degenerate 
primers can be used. For a general description of PCR, see, 
PCR Protocols, Innis et al . eds. Academic Press, 1990. in 
addition, PCR can be used to selectively amplify sequences 
between high copy repetitive sequences* These methods use 
primers complementary to highly repetitive interspersed 
sequences (e.g., Alu) to selectively amplify sequences that 
are between two members of the Alu family (see, Nelson et al., 
Proc. Natl. Acad. Sci. USA 86:6686 (1989)). 

As noted above, CGH at the cytogenetic level is 
facilitating the search for disease genes by identifying 
regions of differences in copy number between a normal and 
tumor genome, for example. For instance, CGH studies have 
been applied to the analysis of copy number variation in 
breast cancer (see, e.g., Kallioniemi et al . Proc. Natl. Acad. 
Sci. USA 91:2156-2160 (1994)). 

In CGH, the resolution with which a copy number 
change can be mapped is on the order of several megabases. 
With the present invention the resolution is a function of the 
length of the genomic DNA segments in the target elements and 
the difference in map position between neighboring clones. 
Resolution of more than a factor of 10 better than with 
standard CGH can be achieved with the present invention. This 
improved localization will facilitate efforts to identify the 
critical genes involved in a disease, and permit more 
sensitive detection of abnormalities involving a small region 
of the genome, such as in microdeletion syndromes. 

Labelling nucleic acid nrob f s 

As noted above, the nucleic acids which are 
hybridized to the target nucleic acids are preferably labelled 
to allow detection of hybridization complexes. The nucleic 
acid probes used in the hybridization described below may be 
detectably labeled prior to the hybridization reaction. 
Alternatively, a detectable label may be selected which binds 
to the hybridization product. As noted above, the target 
nucleic acid array is hybridized to two or more probe nucleic 
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acids, either simultaneously or serially. Thus, the probes 
are each labeled with a separate and distinguishable label. 

The particular label or detectable group attached to 
the probe nucleic acids is not a critical aspect of the 
5 invention, so long as it does not significantly interfere with 
the hybridization of the probe to the target sequence. The 
detectable group can be any material having a detectable 
physical or chemical property. Such detectable labels have 
been well-developed in the field of nucleic acid 

10 hybridizations and in general most any label useful in such 

methods can be applied to the present invention. Thus a label 
is any composition detectable by spectroscopic, photochemical, 
biochemical, immunochemical, electrical, optical or chemical 
means. Useful labels in the present invention include 

15 fluorescent dyes (e.g., fluorescein isothiocyanate, texas red, 
rhodamine, and the like) radiolabels (e.g., 3 H, 12S I, 35 S, 14 C, 
or 32 P) , enzymes (e.g., horse radish peroxidase, alkaline 
phosphatase and others commonly used in an ELISA) . 

The nucleic acids can be indirectly labeled using 

20 ligands for which detectable anti-ligands are available. For 
example, biotinylated nucleic acids can be detected using 
labeled avidin or streptavidin according to techniques well 
known in the art. In addition, antigenic or haptenic 
molecules can be detected using labeled antisera or monoclonal 

25 antibodies. For example, N-acetoxy-N-2-acetylaminof luorene- 
labelled or digoxigenin-labelled probes can be detected using 
antibodies specifically immunoreactive with these compounds 
(e.g., FITC-labeled sheep anti-digoxigenin antibody 
(Boehringer Mannheim)). In addition, labeled antibodies to 

30 thymidine- thymidine dimers can be used (Nakane et al. ACTA 
Histoch&m. Cytochem. 20:229 (1987)). 

Generally, labels which are detectable in as low a 
copy number as possible, thereby maximizing the sensitivity of 
the assay, and yet be detectable above any background signal 

3 5 are preferred. A label is preferably chosen that provides a 
localized signal, thereby providing spatial resolution of the 
signal from each target element. 
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The labels may be coupled to the DNA in a variety of 
means known to those of skill in the art. In a preferred 
embodiment the probe will be labeled using nick translation or 
random primer extension (Rigby, et al . J. Mol. Biol. ,113: 237 
(1977) or Sambrook, et al., Molecular Cloning - A Laboratory 
Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, 
N.Y. (1985) ) . 

Hybridization of labeled nucleic acids to targets 

The copy number of particular nucleic acid sequences 
in two probes are compared by hybridizing the probes to one or 
more target nucleic acid arrays. The hybridization signal 
intensity, and the ratio of intensities, produced by the 
probes on each of the target elements is determined. 
Typically the greater the ratio of the signal intensities on a 
target element the greater the copy number ratio of sequences 
in the two probes that bind to that element. Thus comparison 
of the signal intensity ratios among target elements permits 
comparison of copy number ratios of different sequences in the 
probes • 

Standard hybridization techniques are used to probe 
a target nucleic acid array. Suitable methods are described 
in references describing CGH techniques (Kallioniemi et al . , 
Science 258: 818-821 (1992) and WO 93/18186). Several guides 
to general techniques are available, e.g., Tijssen, 
Hybridization with Nucleic Acid Probes, Parts I and II 
(Elsevier, Amsterdam 1993) . For a descriptions of techniques 
suitable for in situ hybridizations see, Gall et al. Math. 
Enzymol., 21:470-480 (1981) and Angerer et al. in Genetic 
Engineering: Principles and Methods Setlow and Hollaender, 
Eds. Vol 7, pgs 43-65 (plenum Press, New York 1985) . 

Generally, nucleic acid hybridizations comprise the 
following major steps: (1) immobilization of target nucleic 
acids; (2) prehybridization treatment to increase 
accessibility of target DNA, and to reduce nonspecific 
binding; (3) hybridization of the mixture of nucleic acids to 
the nucleic acid on the solid surface; (4) posthybridization 
washes to remove nucleic acid fragments not bound in the 
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hybridization and (5) detection of the hybridized nucleic acid 
fragments. The reagent used in each of these steps and their 
conditions for use vary depending on the particular 
application. 

In some applications it is necessary to block the 
hybridization capacity of repetitive sequences. a number of 
methods for removing and/or disabling the hybridization 
capacity of repetitive sequences are known (see, e.g., wo 
93/18186) . 

For instance, bulk procedures can be used. In many 
genomes, including the human genome, a major portion of shared 
repetitive DNA is contained within a few families of highly 
repeated sequences such as Alu. These methods exploit the 
fact that hybridization rate of complementary sequences 
increases as their concentration increases. Thus, repetitive 
sequences, which are generally present at high concentration 
will become double stranded more rapidly than others following 
denaturation and incubation under hybridization conditions. 
The double stranded nucleic acids are then removed and the 
remainder used in hybridizations. Methods of separating 
single from double stranded sequences include using 
hydroxyapatite or immobilized complementary nucleic acids 
attached to a solid support. Alternatively, the partially 
hybridized mixture can be used and the double stranded 
sequences will be unable to hybridize to the target. 

Alternatively, unlabeled sequences which are 
complementary to the sequences whose hybridization capacity is 
to be inhibited can be added to the hybridization mixture. 
This method can be used to inhibit hybridization of repetitive 
sequences as well as other sequences. For instance, "Cot-l" 
DNA can be used to selectively inhibit hybridization of 
repetitive sequences in a sample. To prepare Cot-l DNA, DNA 
is extracted, sheared, denatured and renatured to a C 0 t - 1 
(for description of reassociation kinetics and C 0 t values, 
see, Tijssen, supra at pp 48-54). Because highly repetitive 
sequences reanneal more quickly, the resulting hybrids are 
highly enriched for these sequences. The remaining single 
stranded (i.e., single copy sequences) is digested with SI 
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nuclease and the double stranded Cot-1 DNA is purified and 
used to block hybridization of repetitive sequences in a 
sample. Although Cot-l ONA can be prepared as described 
above, it is also commercially available (BRL) . 

Analysis of detectable signals from hybridizations 

Standard methods for detection and analysis of 
signals generated by labeled probes can be used* The 
particular methods will depend upon the labels used in the 
probes* Generally, fluorescent labels are preferred. Thus, 
methods suitable in fluorescence in situ hybridization (FISH) 
are suitable in the present invention. The nucleic acid 
arrays are imaged in a fluorescence microscope with a 
polychromatic beam-splitter to avoid color-dependent image 
shifts. The different color images are acquired with a CCD 
camera and the digitized images are stored in a computer- A 
computer program is then used to analyze the signals produced 
by the array. 

Preferred methods of visualizing signals are 
described in Kallioniemi et al . , supra and in WO 93/18186. 
To facilitate the display of results and to improve the 
sensitivity of detecting small differences in fluorescence 
intensity, a digital image analysis system is preferably used. 
A preferred system is QUIPS (an acronym for quantitative image 
processing system) , which is an automated image analysis 
system based on a standard fluorescence microscope equipped 
with an automated stage, focus control and filterwheel (Ludl 
Electronic Products, Ltd., Hawthorne, NY). The filterwheel is 
mounted in the fluorescence excitation path of the microscope 
for selection of the excitation wavelength. Special filters 
(Chroma Technology, Brattleboro, VT) in the dichroic block 
allow excitation of the multiple dyes without image 
registration shift. The microscope has two camera ports, one 
of which has an intensified CCD camera (Quantex Corp., 
Sunnyvale, CA) for sensitive high-speed video image display 
which is used for finding interesting areas on a slide as well 
as for focusing. The other camera port: has a cooled CCD 
camera (model 2 00 by Photometries Ltd. , Tucson, AZ) which is 
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used for the actual image acquisition at high resolution and 
sensitivity. 

The cooled CCD camera is interfaced to a SUN 4/330 
workstation (SUN Microsystems, Inc. , Mountain View, CA) 
5 through a VME bus. The entire acquisition of multicolor 
images is controlled using an image processing software 
package SCIL-Image (Delft Centre for Image Processing, Delft, 
Netherlands) . 



EXAMPLE 1 

This example demonstrates detection of the 
amplification of a specific sequence in a tumor cell line, 
15 Colo-320, which contains an amplification of the cMYC 
oncogene . 

One aliquot of Colo-3 20 DNA was labeled by nick 
translation with FITC-dUTP and a second with Texas red-dUTP 
nucleotides. Normal human DNA, was used as the reference 
20 genome. Two aliquots were labeled similarly to the test 
genome . 

The hybridization array consisted of two target 
elements. One contained cMYC oncogene sequences, and the 
other sequences from a region of the human genome (21D7) known 

25 to be unamplified in the Colo-320 cell line. DNA from Pi 
clones (insert length -80 kb) for these two loci (obtained 
from the LBL/UCSF Resource for Molecular Cytogenics) was 
isolated and cut to completion with the restriction enzyme 
Hindlll, resulting in fragments ranging in length from several 

30 hundred bp to over 10 kb. One base of the resulting overhang 
was filled using biotin-dATP, and the DNA was denatured. Thus 
each single stranded fragment was end labeled with a single 
biotin. The single stranded fragments corresponding to each 
clone were reacted with different aliquots of avidin coated 

35 controlled pore glass (CPG Inc.) "5 p.m particles" (very 

heterogeneous in size and shape) . Thus, one population of 
particles contained cMYC target sequences, and the other 
contained 21D7 sequences. Random priming labeling of the 
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single stranded DNA on the particles using FITC-dUTP showed 
that it was confined to the surface. These large fragments 
evidently did not substantially penetrate into the pores in 
the particles. 

Two comparative hybridizations were performed to 
control for potential artifacts due to the differential 
behavior of labeled probes, and the like. 

1) 300 ng of FITC-labeled Colo-320 genomic DNA and 
300 ng of Texas-red-labeled normal genomic DNA, and 10 tig of 
unlabeled C Q t-l DNA were dissolved in 20 /xl of hybridization 
mix to achieve final concentrations of 50% formamide, 2xSSC, 
and 10% dextran sulfate. This was heated to 70°C to denature 
the DNA, and 10 /il was added to a small number of particles 
containing cMYC sequences. The remaining 10 /il was similarly 
added to a small number of 21D7-containing particles. 

2) This hybridization was similar to the first 
except the fluorochrome labels were reversed. Thus Colo-320 
was labeled with Texas-red and normal genomic DNA with FITC. 
Hybridization proceeded for 3 6-48 hours at 37 °C and the 
particles were washed, suspended in fluorescence anti-fade, 
and mounted on a microscope slide. 

Particles were observed with a conventional 
fluorescence microscope. Hybridization signal was prominent 
on the surface of the particles (appearing as discrete 
fluorescence granules) . Quantitative CCD camera images of the 
individual fluorochrome in representative particles were 
acquired with a digital microscope system with the microscope 
focused near the equatorial planes of the particles. Images 
for particles selected to be 10-15 Mm in "diameter" are shown 
in Figure 1. Due to their size most of each particle was out 
of focus. The upper panel shows the results when the Colo-320 
DNA was labeled with FITC and the normal DNA with Texas red, 
while the lower panel shows the results when the labeling was 
reversed. Within each panel the upper row shows Texas red and 
the lower row shows FITC images. The two columns on the left 
show particles containing 21D7 target sequences, while the two 
on the right are particles with cMYC sequences. The exposure 
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for all of the images was 1 sec, and they are displayed 
without any contrast enhancement or background subtraction. 
The upper panel shows that the Texas red labeled 
normal genomic DNA yielded approximately equal intensities on 
5 the two different 21D7 particles and the two cMYG particles. 
However, the intensity of hybridization of the FITC-labeled 
Colo-32 0 DNA to the cMYC particles was substantially higher 
than to the 21D7 particles. This indicates the presence of 
more copies of cMYC than 21D7 sequences in the cell line since 

10 the ratio of Colo to normal signal on the cMYC particles is 
substantially higher than on the 21D7 particles. The FITC 
signal on the cMYC particles formed a ring at the edge of the 
particle f indicating predominant surface staining. 

The lower panel with reverse labelling shows that 

15 signal due to the FITC-labeled normal genomic DNA was 

approximately equal on all of the particles, while the Texas 
red-labeled Colo-32 0 DNA yielded a brighter signal on the cMYC 
particles. Thus the amplification detected was independent of 
the labeling scheme used. 

20 Quantitative determination of the fluorescence 

ratios was difficult for these particles because of their 
thickness and auto fluorescence. However rough estimates 
indicated that the ratio of the Colo to reference signal on 
the cMYC particles is more than three times (and perhaps 20 

25 times) larger than the ratio on the 21D7 particles. 

The above example is provided to illustrate the 
invention but not to limit its scope. Other variants of the 
invention will be readily apparent to one of ordinary skill in 
30 the art and are encompassed by the appended claims. All 

publications, patents, and patent applications cited herein 
are hereby incorporated by reference. 
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WHAT IS CLAIMED IS; 

1. A method for comparing copy number of nucleic 
acid sequences in a two or more collections of nucleic acid 
molecules, the method comprising: 

(a) providing a plurality of target elements bound to a 
solid surface, each target element comprising a target nucleic 
acid; 

(b) contacting the target elements with: 

(i) a first collection of labelled nucleic acid 
comprising a sequence substantially complementary to 
a target nucleotide sequence, and 

(ii) at least a second labelled nucleic acid 
comprising a sequence complementary to the target 
nucleotide sequence ; 

wherein the first and second labels are 
distinguishable from each other; and 

(c) detecting the amount of binding of the first and 
second labelled complementary nucleic acids to the target 
nucleic acids* 

2. The method of claim 1, wherein the target 
nucleic acids are DNA. 

3. The method of claim 1, wherein the target 
nucleic acids are cDNA . 

4. The method of claim 1, wherein the first and 
second labelled nucleic acids comprise human DNA. 

5. The method of claim 1, wherein the target 
nucleic acids are about 1000 to about 1,000,000 nucleotides in 
complexity. 

6. The method of claim 1, wherein the complexity 
of the sequence complementary to the target nucleic acid 
sequence is less than 1% of the total complexity of the 
collection. 
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7. The method of claim l, wherein the solid 
support is a plurality of beads. 

8. The method of claim 1, wherein the solid 
support is glass. 

9. The method of claim l, wherein the first and 
second labels are fluorescent labels. 

10. The method of claim l, wherein the first and 
second collections of nucleic acids are treated to inhibit the 
binding of repetitive sequences. 

11. The method of claim 10 , wherein the first and 
second collections of nucleic acids are mixed with unlabeled 
blocking nucleic acids comprising repetitive sequences. 

12. The method of claim 11, wherein the unlabeled 
blocking nucleic acids are Cot-l DNA. 

13. The method of claim l, wherein the first 
labeled nucleic acids comprise mRNA or cDMA from a test cell 
and the second labeled nucleic acids comprise mRNA or cDNA 
from a reference cell. 

14. The method of claim l, wherein the first 
labeled nucleic acids are from a test genome and the second 
labeled nucleic acids are from a normal reference genome. 

15. The method of claim 14 , wherein the test genome 
comprises nucleic acids from fetal tissue. 

16. The method of claim 14, wherein the test genome 
comprises nucleic acids from a tumor. 

17. A kit for quantifying nucleic acid sequences in 
a nucleic acid sample, the kit comprising: 
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(a) a solid support having an array of preselected 
target nucleic acids bound thereto where the array has at 
least two members; and 

(b) a container containing reference nucleic acids, 

5 where said reference nucleic acids comprise sequences that are 
complementary and non-complementary to at least one member of 
the array. 



19, The kit of claim 17, wherein the target nucleic 
acids are between about 1,000 and about 1,000,000 nucleotides 



10 



18. The kit of claim 17, wherein the molar ratio of 
complementary and non-complementary nucleic acids is less than 



1:100, 



15 



in complexity. 



20. The kit of claim 17, wherein the kit further 
comprises two different fluorescent labels. 



20 



21. 



The kit of claim 17, wherein the solid support 



is glass. 



22. The kit of claim 17, wherein the reference 
nucleic acids are mammalian are mammalian genomic nucleic 
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acids. 



23. The kit of claim 22, wherein the mammalian 
genomic nucleic acid is of human origin. 



30 
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COMPARATIVE HYBRIDIZATION OF C0L0-320 GENOMIC DNA AND 
NORMAL GENOMIC DNA TO cMYC AND 2ID7 TARGET SEQUENCES 
ON GLASS PARTICLES. 
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